Efficient Coflow Scheduling Without Prior Knowledge — Public Review
نویسنده
چکیده
A lot of blood, sweat and tears have been shed in the quest to improve network performance, mostly in terms of flow completion times. But this race to the top has meant that we have been guilty of forgetting what really matters — application performance. Applications have different notions of the utility they derive from flow completion, so determining the right network metric to optimize is a tricky proposition. In 2012, Chowdhury and Stoica proposed the Coflow abstraction to concisely capture application-level performance semantics at the network layer. A " coflow " refers to a set of flows with a collective goal. For example, the all-or-nothing property of data-parallel jobs entails that coflow completion is what a job really cares about, not when individual flows within a coflow finish. This is a very neat observation! The coflow abstraction leads to the coflow scheduling problem, i.e., how should network traffic be scheduled to minimize coflow completion times? This is an NP-hard problem. Two solutions were proposed at Sigcomm 2014, Varys and Baraat. Varys reduces coflow completion times by scheduling them in order of their total size. This relies on a tightly-coupled centralized design and assumes flow sizes are known a priori. Instead, Baraat adopts a FIFO-like scheduling policy that lends itself to decentralized implementation but at the expense of performance. This paper addresses the shortcomings of these past efforts. It presents a non-clairvoyant coflow scheduling policy which does not require knowledge of flow sizes, while not giving up on any performance to avoid head-of-line blocking. The main idea is to approximate the least-attained service (LAS) scheduling. Coflows start in the highest priority queue but are gradually demoted to lower priority as they send more bytes. Determining a coflow's current size necessitates a coordination point. However, the priority levels are discrete which means the coordination can be done loosely. The authors also describe Aalo, a system that implements such coflow scheduling. The evaluation, performed with EC2 experiments and simulations, shows that Aalo's non-clairvoyant scheduling performs close to schedulers with complete information about coflows. The reviewers agreed that Aalo represents the natural next step in coflow scheduling. They appreciated that the design is simple yet it accounts for several practical details, and it is thoroughly evaluated. By not requiring any a priori information about coflows, Aalo achieves efficient scheduling in spite of operational dynamics due to multi-stage jobs, multi-wave scheduling, failures and speculative …
منابع مشابه
On Scheduling Coflows
Applications designed for data-parallel computation frameworks such as MapReduce usually alternate between computation and communication stages. Coflow scheduling is a recent popular networking abstraction introduced to capture such application-level communication patterns in datacenters. In this framework, a datacenter is modeled as a single non-blocking switch with m input ports and m output ...
متن کاملAdia: Achieving High Link Utilization with Coflow-Aware Scheduling in Data Center Networks
Link utilization has received extensive attention since data centers become the most pervasive platform for data-parallel applications. A specific job of such applications involves communication among multiple machines. The recently proposed coflow abstraction depicts such communication through a group of parallel flows, and captures application performance through corresponding communication r...
متن کاملMulti-hop Coflow Routing and Scheduling in Data Centers
Communication in data centers often involves many parallel flows that all share the same performance goal. A useful abstraction, coflow, is proposed to express the communication requirements of prevalent data parallel paradigms. The multiple coflow routing and scheduling problem faces challenges when deriving a good theoretical performance ratio because coexisting coflows will compete for the s...
متن کاملDelft University of Technology A Coflow-based Co-optimization Framework for High-performance Data Analytics
Efficient execution of distributed database operators such as joining and aggregating is critical for the performance of big data analytics. With the increase of the compute speedup of modern CPUs, reducing the network communication time of these operators in large systems is becoming increasingly important, and also challenging current techniques. Significant performance improvements have been...
متن کاملExperimental Analysis of Algorithms for Coflow Scheduling
Modern data centers face new scheduling challenges in optimizing job-level performance objectives, where a significant challenge is the scheduling of highly parallel data flows with a common performance goal (e.g., the shuffle operations in MapReduce applications). Chowdhury and Stoica [6] introduced the coflow abstraction to capture these parallel communication patterns, and Chowdhury et al. [...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015